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ABSTRACT 

We present measurements of the void probability function (VPF) at z ~ 1 using data from the 
DEEP2 Redshift Survey and its evolution to z ~ using data from the Sloan Digital Sky Survey 
(SDSS). We measure the VPF as a function of galaxy color and luminosity in both surveys and 
find that it mimics trends displayed in the two-point correlation function, £; namely that samples of 
brighter, red galaxies have larger voids (i.e. are more strongly clustered) than fainter, blue galaxies. 
We also clearly detect evolution in the VPF with cosmic time, with voids being larger in comoving 
units at z ~ 0. We find that the reduced VPF matches the predictions of a 'negative binomial' model 
for galaxies of all colors, luminosities, and redshifts studied. This model lacks a physical motivation, 
but produces a simple analytic prediction for sources of any number density and integrated two-point 
correlation function, £. This implies that differences in the VPF across different galaxy populations are 
consistent with being due entirely to differences in the population number density and £. We compare 
the VPF at z ~ 1 to Y-body ACDM simulations and find good agreement between the DEEP2 data 
and mock galaxy catalogs. Interestingly, we find that the dark matter 'particle reduced VPF follows the 
physically motivated 'thermodynamic' model, while the dark matter halo reduced VPF more closely 
follows the negative binomial model. The robust result that all galaxy populations follow the negative 
binomial model appears to be due to primarily to the clustering of dark matter halos. The reduced 
VPF is insensitive to changes in the parameters of the halo occupation distribution, in the sense that 
halo models with the same £ will produce the same VPF. For the wide range of galaxies studied, the 
VPF therefore does not appear to provide useful constraints on galaxy evolution models that cannot 
be gleaned from studies of £ alone. 

Subject headings: galaxies: evolution — galaxies: dark matter — galaxies: clustering 



1. INTRODUCTION 

Voids are some of the most striking large-scale 
features of the Universe. Historically, their study 
can be loosely grouped into two categories: finding 
indivi d ual y oids or using a statistical approach (sec 
iRoodl (|1088f) for a detailed review of the history of 
void studies). The first focuses on identifying individ- 
ual voids with soph isti cated vo id-finding algorithms 
(iKairffmann fc Fairalll 119911 _ _ iKauffmann fc Melottj 
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- . lEl-Ad fc Piranl Il997t lAikio fc MaehoenerJ Il998t 
Sheth et all 12003 iPatiri et all 120051) that allow voids 
to have any convex shape. The properties of galaxies 
voids can then be studied, including their color 
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I2004TI analyze the recently-completed 



Roias^^dJ 
Hovlc & Vogelevl 
2dF Galaxy Redshift Survey and find that void galaxies 
are, on average, bluer and show evidence for more recent 
star-formation than the full 2dF galaxy population. 
Properties of voids themselves can also be studied, 
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including characteristic sizes, mean ellipticities, and 
radial density profiles. Many of these observational 
quantities, such as size measurements, can only be 
interpreted with carefully constructed mock galaxy 
catalogs. Extensive theoretical work has been carried 
out con cerning void size distributions and density 
profil es ll Sheth fc van de Weygaertl 12004 iBenson et al 



1 20031 iColberg et all 120051 iShandarin. Sheth. fc Sahn 
120041 IPatiri et all 20041) . void halo properties 
llAntonuc cio-Delogu et al] 120021: lOott lober et all 
1 2003 IGoldberg fc Vogeievl l200l IPatiri et all 120041: 
Colb erg et al.ll20 05fl . fo rmation of galaxies within voids 
(Mathis fc White! 120021) . and the properties of galaxies 
in voids l)Benson et al J 1200.3) . The question of whether 
simulations can produce voids as large and as empty as 
voids seen in the observed Universe remains unanswered, 
and can potentially provide po werful const raints on 
galaxy formation and cosmology l)Peeblesll200H) . 

The second approach to voids is somewhat more 
statistical and is in many ways complementary to 
the first. Studies of this type focus primarily on the 
void probability function (VPF) which is defined as 
the probability that a sphere of a given size centered 
on a random point in the survey volume contains no 
galaxies. Statistical void distributions in principle offer 
a wealth of information, as they can be related to the 
entire hierarchy of galaxy correlation functions (White 
1979). Yet, due to the difficulties in interpreting void 
statistics, they have not received as much attention as 
more conventional statistical measures of large-scale 
structure such as the two-point correlation function. 
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Despite these difficulties, the theoretical framework 
underlying void statistics has been developed in detail 
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Frvlll984lT985llT98ri lotto et alJll985 iFVvl 
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laSl lfibigna.etn.lJ IT991 IT99fil IRvden fc Mclott 
19961: iKauffmann. Nusser. fc Steinmetd 119971: 



Schmidt. Rvden. fc , Melqttl I 1200 It IBerlind fc Weinberg! 
2002: IBenson et al.11200^1 

Observationally, voids statistics have been investi- 
gated in almost every major galaxy redshift survey 
including the CfA dMaum gordato fc : Lachie ze-Revl 



Vogelev. Geller. fc Hncbral 1 1 99 It IVogelev et al.l iTflfll 
Mo fc Boernerl 11990ft. SSRS llGaztanaea fc Yokovamal 
199.11) . PSCz iHovle fc Vogelevl 12002ft. IRAS 



iEl-Ad. Piran. fc Dacostal 119971) . LCRS (jMuller et all 
12000ft and most recently, the 2dF survey 
llHovle fc Vc^elevll200l iCroton et al.ll2004l iPatiri et all 

2005). Results from the latter survey are representative 
of an emerging consensus; subsamplcs of brighter and/or 
redder galaxies contain larger voids than samples of 
fainter and/or bluer galaxies. This is interpreted as 
brighter and/or redder populations being more strongly 
clustered than fainter and/or bluer subsamples. These 
trends are also reflected in the two-poin t correlation 
funct ion of galaxies, £, bo th at low (e.g.. [Z ehavi et al.l 
12002ft and moderate (e.g., iCoil et all l2004bft redshifts. 
We note that some studies of voids have not taken 
into account the fact that galaxy populations have 
different number densities which will strongly affect 
void statistics; hence differences in the VPF across 
galaxy populations might as easily be attributable to the 
luminosity function as to varying clustering strengths. 
As brighter galaxies are rarer than fainter galaxies, even 
in volume-limited samples, brighter samples will have 
higher void probabilities. A critical question we wish 
to address here is the extent to which void statistics 
are governed by low-order clustering statistics. It is 
therefore important to separate the effects of clustering 
from the effects of number density on the VPF. 

Void statistics can also be used to probe the relation 
between galaxies and dark matter. There are strong the- 
oretical arguments that show that the clustering strength 
of dark mat ter and galaxi es should be different (e.g., 
| Kaiserlll984l iBardeen et al.lll986t lEfstathiou et al.lll98St 
ICole fc Kaiserl Il989t IMo fc White! 119961) . Galaxies ini- 
tially form in high density peaks of the dark matter 
distribution and hence are "biased" tracers of the dark 
matter at high redshift. Baryonic physics and cosmic 
evolution also lead one to expect that the biasing be- 
tween galaxies and dark matter should, in principle, be a 
function of scale, redshift, and galaxy properties such 
as color and luminosity. Many of these expectations 
have been borne out in observations (see for example 
I Davis fc Gelled 119761 I Lovedav et all 119951 IZehavi et all 
l2002UMadgwick et al.ll2003HCoil et al.ll2004bfl . 

By studying voids in observational samples and 
comparing to dark matter simulations, one can hope to 
probe the galaxy bias in a unique way. The VPF has 
previously been studied in simulations where a biasing 
prescription was assumed in order t o place galaxies in 
the d ark matter distribution (e.g.. ILittle fc Wei nberg 
19941 IKauffmann. Nusser. fc Stemmed Il997t 



Berliud fc W einberg! 12002! IGhigna et a.U 11994 [l996)' 



IKauffmann. Nusser. fc Steinmetzl (^997) find that the 
VPF has limited utility in probing the bias, as the 
relation between dark matter and galaxies changes as 
a function of the sampl ing d ensity of a galaxy s urvey . 
IWeinberg fc Cola] lfl99l and ILittle fc WeinheTel lTl99l 
however, find that the VPF can be a powerful discrim- 
inator between various biasing schemes, but that it is 
relatively insensitive to the value of the linear bias factor 
b = {igai/idm) 1 ^ 2 ■ For example, the VPF is very differ- 
ent, for fixed 6, when one statistically identifies galaxies 
with overdensities in the initial density field ('peaks 
biasing') versus identifying galaxies with overdensities 
in the final matter distribution ('density bia sing') . 

More recently, the halo model (e.g.. iSeliakl 2000; 
Peacock fc SmitH |200?i ICoorav fc Shetbl 12002 
Kravtsov et al.l 120041) has emerged as a useful pre- 
scription for placing galaxies within dark matter 
simulations. Instead of using a biasing scheme such as 
'peaks biasing' or 'density biasing' to relate galaxies 
to the dark matter distribution, the halo model places 
galaxies within virialized dark matter halos as a function 
of the halo mass. Although in principle the parameters 
of the halo model can evolve with cosmic time, there is 
evidence to suggest that the model does not strongly 
evolv e from z ~ 1 to z ~ llYan, Madgwick. fc Whita 
12003^ . IBerlind fc Weinberg! <|2002|) find that the VPF 
can be used to determine halo model parameters such 
as the minimum dark matter halo mass, M m i n , in which 
a galaxy of a given luminosity may exist, and can hence 
provide useful new constraints on the relation between 
galaxies and dark matter. In this paper we explore in 
detail the possibility of using the VPF to constrain the 
halo model. 

Until now, there has not been sufficient data to study 
the statistics of voids at intermediate redshift (z > 
0.3). From correlation func tion measurements using the 
DEEP2 dataset at z ~ 1 l|Coil et al.ll2004bft we know 
that galaxies were in general less strongly clustered in 
the past; one might expect this trend to be reflected in 
void statistics as well. As more data becomes available 
at higher redshifts, the importance of self-consistently in- 
vestigating the evolution of galaxy properties and statis- 
tics cannot be overemphasized. This entails using similar 
selection and analysis techniques at different redshifts. 
For this reason, we have chosen to include in this study an 
analysis of the SDSS, which now has data for ~ 200, 000 
galaxies at z < 0.2, to allow for robust conclusions con- 
cerning the evolution of void statistics. 

It should be kept in mind that the two approaches to 
the study of voids outlined above use the term "void" in 
very different ways. While the first approach identifies 
voids as large underdensitics in the galaxy distribution, 
allowing a void to assume any convex shape and allowing 
galaxies to exist inside voids, the second approach identi- 
fies voids as spherical regions in which surveyed galaxies 
are totally absent. Furthermore, while the first approach 
only locates "unique" voids, insofar as it does not allow 
smaller "sub-voids" to be contained within larger voids, 
the second approach allows for "voids" to overlap (see 
for a visualization of this idea) . The statistical approach 
is only interested in the question: what is the probability 
that a given volume element in the universe is empty? In 
this paper we are concerned with the second, statistical 
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approach, and hence we define voids in the latter sense. 

The rest of this paper is organized as follows. In 
wc outline the theory behind void statistics and point 
out several problems that arise with the theoretical un- 
derpinnings of voids, describes our methodology. We 
statistically analyze voids in mock galaxy catalogs built 
from N-body simulations in jJ3| in SjSJwe present our re- 
sults from the DEEP2 galaxy survey at z ~ 1 and in 
we analyze the SDSS galaxy survey at z ~ 0. ^discusses 
the implications and relevance of our results. Through- 
out this paper we assume a flat concordance ACDM cos- 
mology with f2 m = 0.3, f^A = 1— fim = 0-7 an d Hq = 100 
h km s _1 Mpc- 1 . 

2. THEORETICAL BACKGROUND 

2.1. General Considerations 

The void probability function (VPF) is defined as the 
probability of finding no galaxies inside a sphere of radius 
R, randomly placed within a sample. The most common 
theoretical interpretation of the VPF is as an infinite sum 
of the hierar chy of corre l ation functio ns of the galaxy 
distribution l|Whitelll979t lShethlll996|) . Specifically, for 
spherical volume elements one may write the VPF as: 



Pq(R) = exp 



E 



-N(R)p 



(i) 



where R is the sphere radius, N is the average number of 
galaxies within the sphere, and £ p is the volume averaged 
p-point correlation function, where the volume average is 
defined by 

i = (2) 

As Pq depends on a recurring factor of N , any mean- 
ingful comparison of P between populations requires a 
careful handling of their number densities. For many 
comparisons made in this study, we choose to remove 
the dependency on by randomly diluting the samples 
to have similar number densities. In this way we can 
isolate the effects of clustering on the VPF. 

The VPF takes a much simpler form if one uses the 
hierarchical Ansatz, 



t P = S P Z*- 1 ) P>3 



(3) 



to relate the hierarchy of correlation functions to the 
two-point function, £. This allows for a complete and 
relatively simple description of the entire cosmological 
density field. The Ansatz has been formally derived only 
for an f2 m = 1 universe with the addit ional assumptions 
of sta ble clustering and self-similarity l)Bernardeau et all 
2002), which are both known to be invalid. In the lin- 
ear regime (R > 15 Mpc) perturbation theory seems 
to validate the Ansatz, although this paper is largely 
confined to smaller scales. So far, the Ansatz has not 
been ruled out by obse rvations (see e.g. | Gaztanagalll992T: 
iGaztanaga et al.lll995t rBaugh ct al. 2004), although it is 
worth mentioning that Baugh et al. had to remove the 
largest structure in their sample in order to recover scale 
invariant S p values. There is no reason to believe that 
the Ansatz should hold in the quasi-linear to strongly 
nonlinear regimes. Although historically the S p values 



were assumed to be scale invariant, nothing in the for- 
malism developed below requires them to be. With the 
hierarchical Ansatz the VPF becomes: 



Po = exp 



LP=1 



-NP 



(4) 



iFrvl l)198fi() noted that Eqn. 0] could be manipulated to 
isolate the effects of the scaling coefficients (S p ). Fry 
defined the reduced void probability function, x' 



X = -ln(P )/N 
which, with the substitution of Eqn. 0] becomes: 



Sn 



P =i 



pi 



(5) 



(6) 



With this definition, N£ becomes the independent vari- 
able, and hence only populations of galaxies with differ- 
ent S p values will have different reduced VPFs. Although 
N£ will in general have a different dependence on R for 
each galaxy population, this quantity will always be an 
increasing function of R since N tx R 3 and £ oc R~~« 
with 7 < 3 for all of the populations considered here. 
We now briefly summarize several models which predict 
values for the S p values. 

2.2. Hierarchical Models 

As the dynamical equations governing gravitational 
clustering can not be solved in the weakly to strongly 
nonlinear regime using perturbation theory (or any of its 
offspring), various phenomenological models have been 
proposed to relate the higher order correlation functions 
to the two-point correlation function. These models pro- 
vide a complete description of gravitational clustering, 
yet each is inadequate either theoretically or observation- 
ally. See Fig. ^ for examples of predictions of the VPF 
from some of the more popul ar models; for a detailed 
treatment see|EY}J (|1986t I1988D . 

In a Poisson model, all moments with p > 1 vanish, 
and the VPF can be described simply by: 



Po 



-N 



x = i- 



(7) 



A Gaussian model is almost as simple and equally in- 
applicable to the statistics of large scale structure in the 
universe on nonlinear scales at late times. This model im- 
plies that the entire statistical distribution is described 
by the two-point moment (in this case £) and that all 
higher moments identically vanish. The Gaussian and 
Poisson models are obviously not actually hierarchical 
models; we include them under 'hierarchical models' for 
simplicity. 

More realistic models rely on the hierarchical Ansatz to 
specify all possible moments of the galaxy distribution. 
These models can be thought of as prescriptions for the 
S p scaling coefficients. The simplest model in this case 
forces S p = 1 for a ll p and is sensibly called the 'Minimal' 
model ()Frvl Il98fil) . Yet another model was motivated 
by an attempt to combine the hierarchic al Ansatz with 
the BBGKY kinetic equations l|Frvlll98^ . The BBGKY 
model becomes unphysical for large void radii 2 as can be 
seen in Fig. ^ where x becomes negative for A^£ > 30. 

The 'thermodynamic m odel' was first proposed by 
iSaslaw fc Hamilton! l)1984[) and arose from a theory which 
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Fig. 1. — Reduced VPF for several of the hierarchical models 
discussed in The models are most easily distinguished from 
one another for N£ > 5, a region that can be investigated with 
currently existing low- and high-redshift galaxy samples. Note that 
the Poisson model requires \ = 1. 



treated galaxy clustering by analogy to statistical me- 
chanics. I t wa s later extended into a more self-consistent 
model bv iFrv 



;198' 
X = [(1 + 2N0 



1/2 



(8) 



S p = (2p-3)! 



(9) 

Intriguingly, a model developed bv lShethl l|1998j) which 
combines an excursion set approach to the evolution of 
the halo mass function with a simple model for the spa- 
tial distr ibutio n of such halos predicts the same values for 



S p . iFrvl ((1985) points out that this theory is only strictly 
true for large volumes, and that the derivations do not 
apply for £ ~ 1 (r ~ 5 h' 1 Mpc). It is also difficult 
to understand how such large scales could have become 
thermodynamically re laxed over the age of the universe. 
lHamilton et al.1 <|1985f> compared this model prediction 
to the VPF from the CfA survey, but their results were 
inconclusive. They did find, however, that volume lim- 
ited subsamples of different minimum luminosities were 
hierarchically related in that samples with different N 
and £, had similar S p . 

The final model we consider has had a colorful history. 
The negative binomial distribution, also known as the 
modified Bose-Einstein di stribution, was used early on by 
iCarruthers fc S hih ( 1983) to study the count distribution 
of char ged hadrons resulting from high- energy collisions 
and bv ICarruthers fc Duong- van! l|1983f) to describe the 
observed distribution of Zwicky clusters. The model can 
be described by: 



X = In(l + NO/m, 



(p-l)l 



Po 



1 



(10) 

(11) 

(12) 



iGaztanaea fc Yokovamal l)1993|) analyzed the reduced 
VPF in the SSRS2 and CfA redshift surveys but could 



not discriminate between the thermodynamic and neg- 
ative binomial models because of the size of their er- 
rors. The negative binomial distribution was re-derived 
in their appendix by considering a sample divided into 
small cells, with the occupation probability of each cell 
depending only on £, and being independent of the other 
cells (see also lElizalde fc Gaztanaeal I1992J) . Unfortu- 
nately, the derivation contains little insight into the phys- 
ical mechanisms that might drive point distributions to 
become negative binomial. This model has also bee n de- 
ri yed from thermodyna mic a rguments (lShet b||1995l). 

IMo fc Boernerl 1)199(1) and iVogelev. Celler. fc Huchral 
( 1991) independently analyzed the CfA survey and found 
the data to be more consistent with the negative bino- 
mial than thermodynamic model over a range of lumi- 
nosity thresholds and morphological types, though the 
agreement be tween model and da ta was not conclusive. 
Most recently. ICroton et alJ l|2004|) found that galaxies in 
the 2dF survey follow the negative binomial model over 
a range of differential luminosity bins. 

2.3. Convergence Issues 

The astute reader will have noticed two problems with 
the theoretical models presented above, one related to 
convergence and the other to the unintuitive nature of 
the sums in Eqns. ^ 01 an d El ■ The form for the S p 
values associated with the thermodynamic and negative 
binomial models fEans. l9land lllf> can be understood as 
arising from a Taylor series expansion for the given x 
values in Eqns. 03 andjlUI Such an expansion is, how- 
ever, only valid for N£ < 1; the sum rapidly diverges if 
iV£ > 1. The conclusion must be that the S p values often 
quoted for these models cannot be correct for large iV£, 
i.e. for r > 3 h^ 1 Mpc (the precise r at which N£ = 1 
depends on the sample). 

Realizing this issue, we have explored other forms for 
the S p values such that the sums in Eqns. 0] and [6] do 
converge. Simplistic models such as: 



O - p Ap+B 

Or) C , 



(13) 



where A and B are free parameters, are consistent with 
recent observations of S p for p < 6 (Ba ueh et alJl2004|) 
and are also consistent with (p — 1)! for small p. Al- 
though the sums converge with these S p values, they do 
not converge to a hierarchical model; in fact, for many 
values of A and B, the sums converge to non-physical 
values (Po < 0). It is unclear how far one should pursue 
this, given (1) that the S p values are probably scale- 
dependent, and here we have assumed that they are not, 
(2) there is no theoretical reason to suspect that the 
higher-order moments should be simply proportional to 
a power of £, and (3) though mathematically valid, the 
expansion of Po m terms of £ p is not terribly useful in 
practice. 

To illustrate (3) consider the following. One might 
think that increasingly higher-order moments would be 
increasingly irrelevant for void statistics. In fact, we have 
explicitly computed the sum in Eqn. ^for the first eight 
correlation functions in large galaxy mock catalogs and 
find that the resulting partial sum in no way approxi- 
mates the VPF measured from the same mock catalogs. 
It seems that increasingly higher-order moments are, in 
fact, increasingly relevant. 
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In order to gain a clearer understanding of this we have 
investigated the sequence of partial sums of Eqn. fusing 
the S p values given in Eqn.^Il We find that the sequence 
of partial sums for Pq oscillates wildly for p < 20, but 
rapidly converges for p > 25. In other words, it is not 
the first few, but the first ~ 25 correlation functions that 
are necessary to accurately describe the void distribution. 
Though it is comforting to know that the sum eventually 
converges, it is quite puzzling why convergence should 
require so many correlation functions. 

One possible explanation for the importance of higher- 
order moments is the following: since there exist clusters 
with twenty objects or more, we might expect the 20- 
point correlation function to be non-zero. This function 
describes the probability, in excess of random, that a re- 
gion of space which contains 19 objects will contain a 
20th. Since this will almost never happen for a random 
sample, the 20-point function will likely be quite large, 
at least on small scales. Hence one might expect con- 
tributions from p-point correlation functions as long as 
there are clusters of objects with p members. 

3. METHODOLOGY 

We next describe our general methodology for obtain- 
ing void statistics from large samples of galaxies. Is- 
sues concerning specific survey details, such as the proper 
handling of angular window functions, are treated as they 
arise in §§ 5 and 6, where we use data from different sur- 
veys. 

Investigating the VPF and reduced VPF requires the 
measurement of three quantities: N, £, and Pq, all of 
which are functions of the sphere radius, R. Each of 
these quantities is straightforwardly determined by a 
counts-in-cells (CIC) approach. One simply places large 
numbers (~ 10 5 ) of random spheres within the survey 
and counts the number of galaxies contained within each 
sphere. This is then repeated for many sphere radii. N 
is the average number of galaxies in a sphere: 



N 



1 



Ntat 

■ 2> 



(14) 



where Aftot 5 is the total number of spheres placed and 
Ni is the number of galaxies in the ith sphere. Pq is the 
number of spheres containing zero galaxies divided by 
the total number of spheres, 



Pn 



Aftot 



(15) 



where A/o is the number of spheres that contain zero 
galaxies, and £ is the variance in the number of galaxies 
per sphere: 

^wzw^i (16) 

In the limit of large numbers of random points, the CIC 
approach for determining £ is known to be ma themati- 
cally equivalent to more conventional methods l|Szapudil 
1998). We independently confirm this result by compar- 
ing our CIC-measured £ to the volume-averaged corre- 
lation function obtained via the popular Landy-Szalay 

5 Here and throughout we use Af to refer to the number of spheres 
and N to refer to the number of galaxies. 




Fig. 2. — Convergence of measured quantities as a function 
of the number of randomly placed spheres (Aftot) for galaxies in 
our simulations with a sphere radius of 7 h -1 Mpc. The rapid 
convergence is due to the fact that the simulation volume is well- 
sampled once the number of test volumes exhausts the number 
of independent configurations, which for this simulation, at a test 
sphere of R = 7 /i -1 Mpc, occurs at Aftot ~ 1500. 



estimator l|Landv fc Szalavl H993). We find that the two 
approaches are entirely consistent. To test the sensitiv- 
ity of these measured quantities to the total number of 
spheres used, Mot, we have computed Pq, N, and £ us- 
ing a sphere radius of 7 h^ 1 Mpc, as a function of Aftot 
(Fig. EJ , for mock galaxies in a simulation box of length 
256 h~ x Mpc (see below for simulation details). As ex- 
pected, for small Aftot , these quantities are unstable, but 
for Aftot ^ 10 4 the quantities converge well. This rapid 
convergence holds for both smaller and larger void radii 
and is due to the fact that the simulation volume is well- 
sampled once the number of test volumes exhausts the 
number of independent configurations. To be conserva- 
tive, we use Aftot ~ 10 5 to calculate these quantities. 
Errors for all measured quantities are estimated by "jack- 
knife" sampling. 

Before moving on, we should highlight the fact that all 
quantities used in analyses of void statistics are spheri- 
cally averaged. This type of averaging effectively throws 
away much of the useful information contained in higher 
order statistics. For example, £3 measures the skewness 
of the distribution, and will in general not be spherically 
symmetric. Hence it is unclear whether or not spheri- 
cally averaging £3 is even the proper way to average £3; 
these ambiguities make interpretation difficult. 

The £ used here and in other void analyses refers to 
the correlation function in redshift space; the correlation 
function has not been de-projected into real space, which 
is the more conventional way of reporting £. This is 
important because, as we will see, the real space VPF 
measured in mock catalogs is different from the behav- 
ior of the VPF in redshift space, and our main conclu- 
sions turn out to be valid only in redshift space. Indeed, 
iVogelev et alJ (|1994f) find that, at fixed void radii, Pq in 
simulations is larger in redshift space compared to real 
space, with the difference increasing at larger void radii. 
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These authors also find that redshift space void statistics 
closely follows one or another hierarchical model (which 
model the statistics follow depends on gross cosmologi- 
cal models, i.e. closed v.s open and biased vs. unbiased), 
while the real space void statistics do not. One exam- 
ple of redshift-space effects relevant to this study is the 
flattening of the S p scaling coefficients in redshift space 
when compared to real space. This is due to the reduc- 
tion of clustering strength on small scales caused by pe- 
culiar velocities (known as the fingers-of-God effect) . See 
iBernardeau et all l)2002|) for examples of this and other 
redshift space effects. 

Finally, we require that the number density of the 
sample under consideration be independent of redshift, 
and so construct volume-limited samples for our analysis. 
Specifically, we require that each galaxy be observable 
over the entire redshift range we consider. 

4. VOID STATISTICS IN SIMULATIONS 

In this section we analyze void statistics in mock galaxy 
catalogs constructed from N-body ACDM simulations in 
order to help understand and interpret void statistics re- 
covered from observational samples. First we measure 
the VPF and reduced VPF for dark matter particles and 
mock galaxies at z ~ and z <~ 1. We then investigate 
the reduced VPF for different halo model halo occupation 
distributions (HODs) and find that they are similar over 
a wide range of HOD parameters. Next, we determine 
the effects of redshift space distortions on the VPF and 
finally wc compare VPFs for halo centers alone to VPFs 
for mock galaxies. Throughout this section we use full 
simulation boxes at two outputs: z = 0.087 ("z ~ 0") 
and z = 0.92 ("z ~ 1"); in |J2I where we investigate 
the effects of survey geometry on the VPF, we extract 
light-cone geometries from the full simulation. Except 
for H4.3.1I all analyses in this section are performed for 
galaxies, dark matter particles, and dark matter halos, 
in redshift space. 

4.1. The Simulations 

The mock galaxy catalogs we use were constructed 
specifically for the DEEP2 surve y. A complete de- 
script ion of the catalogs is given in lYan. White, fc Coill 
(2004); we give the relevant details here. N-body simula- 
tions of 512 3 dark matter particles with a particle mass 
m par t = 1.0 x 10 10 /i _1 M were run in a ACDM uni- 
verse using the TreePM code llWhitel l2002) in a periodic, 
cubical box of side length 256 h^ 1 Mpc. Dark matter 
halos were identified by running a "friends-of-friends" 
group finder. Galaxies were then inserted via a halo 
model approach, where galaxies are placed in dark mat- 
ter halos using a simple prescription (see e.g.|S eliak 2000; 
| Ma fc Frvl200alPeacock fe Smithl2000llCoorav fc Shethl 
120021 iKravtsov et alJl2004|) . 

The main ingredient of the halo model is the halo oc- 
cupation distribution, HOD, which specifies the mean 
number of galaxies to be placed in a given halo, as a 
function of the halo mass ((N(M))). In its most com- 
mon form the HOD is the sum of a power-law describing 
the sub-halo (satellite) population and a step function 
above some minimum mass, which describ es the host 
halo (central galaxy) ijKravtsov et al.ll2004|) . Luminosi- 
ties are then assigned to galaxies in a halo according to a 
conditional luminosity function (CLF), $(L|M), which 



specifies t he luminosity fun ction of galaxies in halos of 
mass M ifYang et al]l200fl . The CLF is traditionally 
chosen to have a Schechter form. The same HOD and 
CLF are used to populate halos at both high and low red- 
shift, with only the underlying dark matter distribution 
and the characteristic scale of the luminosity function, 
L*, evolving with redshift (evolution in L* was taken to 
be 1 mag based on COMBO-17 data iWolf et al]l2003h l. 
This "no evolution" hypothesis produces catalogs that 
are in agreement with two-point clustering measurements 
from DEEP2 at z ~ 1 and the Bj-band luminosity func- 
tion and two-point clu stering of 2dF galaxies at z ~ 
(jMadgwick et al.|l20 03). For what we call the 'primary' 
simulations, the specific parameters of the HOD and CLF 
are chosen to best match the two-point clustering mea- 
surements at z ~ 1. 

4.2. VPF of Dark Matter and Mock Galaxies 

This section uses the 'primary' simulations as de- 
scribed above. All samples drawn from simulations 
have been randomly diluted to the same number den- 
sity (n — 0.009 h 3 Mpc -3 ) to allow for a direct compar- 
ison between VPF measurements (recall that the VPF, 
as opposed to the reduced VPF, is quite sensitive to N). 
Furthermore, all galaxies in these samples are restricted 
to have luminosities -22 < Mb — 51og(h) < -19. Fig. El 
shows the VPF calculated for dark matter particles and 
galaxies at z <~ 1 and z ~ in redshift space, where er- 
rors are estimated using jackknife sampling. There are 
two obvious trends. The first is that, at both redshifts, 
galaxies have a larger Pq at all radii compared to dark 
matter, indicating that they are more highly clustered 
and that galaxies included here are biased relative to 
the dark matter. This simply results from the fact that 
these galaxies live in massive dark matter halos that are 
large overdensities and hence are more highly clustered 
than the overall dark matter distribution. This is not 
in conflict with the realization that at z ~ the galaxy 
distrib ution accurately t races the dark matter on large 
scales l|Verde et alJ 2002) because (1) we are probing dif- 
ferent scales and (2) these studies have only measured 
the linear and quadratic bias terms, whereas we are in 
principle sensitive to all non-linear biasing terms. The 
second result in Fig. |3 is that both galaxies and dark 
matter become more clustered with time, as seen by the 
larger voids at z ~ relative to z ~ 1; this is due to the 
effects of gravity. 

In Fig. 21 we show the reduced VPF separately for low- 
and high-rcdshift mock samples as well as for two hi- 
erarchical models, the negative binomial (dashed) and 
thermodynamic (dash-dot). Here as elsewhere, the up- 
per limit on N£ in the reduced VPF is set by the radius 
at which Po — 0; at that point x becomes undefined. 
The error on the mean in each point (computed by di- 
viding the cube into octants and measuring void statistics 
within each octant) is smaller than the plotted symbols 
(~ 0.001), although errors are highly covariant between 
different sphere radii. In principle there should be errors 
on JV£ in addition to \> as ^£ will vary across the oc- 
tants at a specific R. In practice however, the error on 
N£ is negligible as it is smaller than the binsize. 

The dark matter VPF is well described by the thermo- 
dynamic model at all scales for both z ~ 1 and z ~ 0, 
while the galaxy VPF fits the negative binomial model 
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Fig. 3. — VPF (Po) measured in mock galaxy catalogs and 
dark matter N-body simulations as a function of comoving radius 
at z ~ 1 and z ~ 0. It is apparent that dark matter at z ~ 
(asterisks) has larger voids, and hence is more clustered, than dark 
matter at z ~ 1 (triangles). Similarly, galaxies at z ~ (diamonds) 
are more clustered than galaxies at 2 ~ 1 (squares), and galaxies 
have more voids than dark matter particles at both epochs.. All 
samples shown here were randomly diluted to a number density of 
n = 0.009 h 3 Mpc — 3 , and the mock galaxies were chosen to have 
luminosities in the range —22 < Mb — 51og(h) < —19. 
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Fig. 4. — Reduced VPF measured in redshift space from simu- 
lations at z ~ (top) and z ~ 1 (bottom). At both redshifts the 
galaxies follow the negative binomial model (dashed line) while the 
dark matter follows the thermodynamic model (dot-dashed line). 
Since all samples have been diluted to the same number density, 
this implies that the trends seen in Fig.[3]are due solely to changes 
in the volume-averaged two-point correlation function. See text for 
details. 



to a precision better than the achievable observational 
errors. In fact, the dark matter matches the thermody- 
namic model well to at least z ~ 2, the highest redshift 
simulation output available. This directly shows that the 
VPF can be entirely parameterized in terms of two vari- 
ables, N and £, since, for a given hierarchical model, \ 
is a function of only of their product (see Eqn. IT2|) . The 
differences seen in the VPF for galaxies at low and high 
redshift (Fig. 0J) are described entirely by the evolution of 
£ with cosmic time. Below we find that this dependence 
holds in the data as well, not only as a function of red- 
shift but also for sampl es with varying galax y properties 
at the same redshifts. iCroton et al.1 1)20041 ) do not find 
this good agreement between the thermodynamic model 
and dark matter particles drawn from the Hubble Volume 
simulations. We speculate that the much larger particle 
mass (nipart ~ 10 12 /i _1 M Q ) of these simulations does 
not afford an accurate measurement of the dark matter 
particle reduced VPF. 

There is a small but perceptible up-turn in the reduced 
VPF for all samples in Fig.^lat the largest N£ measured. 
A much larger up-turn has been seen in the CfA sur vey 
llVogelev. Geller. & HuchralT^ll f\^gelev et al.lll99l at 
similar void radii (7 — 15 h^ 1 Mpc, depending on the 
luminosity threshold). This may be due to fluctuations 
in the initial conditions of the simulations, as both the 
galaxies and dark matter at low and high redshift were 
derived from the same simulation. Furthermore, these 
large scales (corresponding to void diameters of ~ 30 
h^ 1 Mpc) are approaching the limit at which a simu- 
lation with box size of L — 256 3 h~ 3 Mpc 3 becomes 
unreliable due to the fact that large-scale modes, which 
become increasingly important on larger scales, cannot 
be included in simulations with periodic boundary con- 
ditions. Future studies with different realizations of the 
initial conditions and larger box sizes will be able to ad- 
dress the significance of this deviation. Regardless, we 
are unable to probe these largest void scales with the 
currently available galaxy surveys. 

Finally, we find that the good agreement between the 
reduced VPF in simulations and the negative binomial 
model is insensitive to random dilutions of the mock 
galaxies. Specifically, we randomly dilute the mock 
galaxy catalogs to 25%, 50%, and 75% of the nominal 
number density and find no differences in the resulting 
reduced VPFs, as expected since removing dependencies 
on N was a major reason for introducing the reduced 
VPF in the first place. Defining the VPF as a sum of 
higher order correlation functions and then inserting the 
hierarchical Ansatz provides a simple explanation for this 
insensitivity of the reduced VPF on N. 

4.3. Constraining the Halo Model 

As mentioned above, the halo model can be very useful 
for placing galaxies in cosmological dark matter simula- 
tions. It also provides a way of understanding the VPF 
without resorting to an infinite sum of higher-order cor- 
relation functions. Specifically, 
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where S rll is the mass density contrast smoothed over a 
volume V, P{5fa) is the probability of having a given 
smoothed mass density contrast, and P(M < M m i n \S m ) 
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Fig. 5. — Top: Various HODs used to populate galaxies in 
dark matter halos in simulations. Bottom: The resulting reduced 
VPF, in redshift space, for the simulations with the HODs shown 
in the top panel, plotted as differences from the negative binomial 
model. The only HOD inconsistent with the negative binomial 
model, model Bl, is easily ruled out observationally by its two- 
point correlation function. The reduced VPF is almost entirely 
insensitive to the number of galaxies placed in a halo as a function 
of galaxy mass. Furthermore, any changes seen in the (un-reduced) 
VPF can be solely attributed to changes in £. Hence studying the 
VPF seems to provide no new constraints on the HOD beyond the 
information available in §. 



is the probability of finding a halo with mass less than 
M m i n in a volume V with smoothed density contrast 
S m . Here we have written Pq(V) explicitly as a func- 
tion of the smoothing volume, but note that it is the 
same Po that occurs elsewhere. The only input from the 
halo model is the value for M m i n , the mass above which 
halos contribute to P . P{8 in ) is variously assumed in 
the literature to have a lognormal, negative binomial, or 
Gaussian distribution, and P(M < M m i n \8 m ) is fixed 
by the relation between halos and dark matter (the halo 
bias), which can be determined from simulations. 

We would like to test the ability of the VPF 
to constrain the halo model, as suggested by 
iBerlind fc Weinberg! (2002). Their main results are (1) 
that the VPF is sensitive to M m j n , the minimum mass 
that a dark matter halo must have to host a galaxy, and 
(2) that the VPF is entirely insensitive to the spatial 
distribution of galaxies within halos. 

To investigate these claims we calculate void statistics, 
in redshift space, for mock galaxy catalogs constructed 
from various HODs. The top panel in Fig. El shows five 
differ ent HODs with the simple form ijKravtsov et al.l 
l200l : 
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25.0,7.9,4.9,4.1 and 3.9 (in 



units of 10 11 ft." 1 Mq). For each M m i„, Mi is determined 
by fixing the overall number density at 0.01 h 3 Mpc -3 . 
These five models are referred to as Bl, B3, B5 ,B7, and 
B9, respectively. 

In order to clearly show small variations between the 
models, in Fig. El (bottom panel) we plot differences be- 
tween the negative binomial model and the measured 
reduced VPF for each galaxy catalog. The reduced VPF 
remains essentially unchanged for these different HODs, 
leading us to conclude that the VPF can be entirely de- 
termined by £, i.e., by Eqn. The scatter between 
the five models is ~ 0.02, much smaller than the at- 
tainable accuracy with even the largest current galaxy 
surveys; we therefore conclude that over a wide range of 
physically meaningful HODs, the VPF cannot constrain 
M min beyond constraints attainable from £ alone. In the 
figure, the reduced VPF for the Bl HOD is the most 
deviant from the negative binomial model. However, ob- 
servations of the two-point correlation function at z ~ 1 
(|Coil et al.H2005fl rule this model out. 

These results are con sistent with the claims of 
IBerlind fc Weinberg! l)2002j) . as they were concerned with 
void statistics in real space. The halo model formal- 
ism becomes much more complex analytically in redshift 
space, due to the presence of peculiar velocities. We 
find using simulations, that redshift-space effects essen- 
tially 'wash out' any differences in the VPF due to differ- 
ent HODs. Qualitatively, one would expect that placing 
more galaxies in halos would increase the S p values on 
small scales in real space. However, in redshift space, 
small-scale power is more smeared out because there are 
now more galaxies residing in massive halos with higher 
velocity dispersions. It is only a surprise that these two 
effects should act to cancel each other out exactly. We 
now investigate the effects of redshift-space distortions 
on void statistics in detail. 

4.3.1. Redshift Space Effects 

The effects of redshift space distortions on clustering, 
statistics have been studied in detail (see e.g.lLahav et alJ 
119931 lWhitdl200lt lSelialdl200lt lScoccimarroll2004fl. and' 
have been probed with void st atistics iVoeelev et all 
Il99l IRvden fc Melottl Il99fih . IVogelev et alJ IU994H 
found that voids appear somewhat larger in redshift 
space and that the reduced VPF in real space does not 
agree with any of the hierarchical models. 

We explore the extent to which redshift-space effects 
wash away information by calculating the VPF result- 
ing from several HODs with fixed M m i n and varying a 
(for 0.5 < a < 1.0), i.e. varying the number of galaxies 
placed in halos with mass above M m i n , while still keep- 
ing the overall number density fixed at 0.01 h 3 Mpc -3 . 
One might imagine that, by fixing M m in and varying a, 
Po would remain unchanged, but £ would vary because 
adding more galaxies to a halo will strongly affect the 
small scale clustering. If this were true, then Po would 
not simply be a function of £ alone. By analyzing a suite 
of mock galaxy catalogs constructed with these various 
HODs, we find, as expected, that Po remains unchanged. 
Yet surprisingly, but in accordance with the idea that 
redshift-space distortions conspire to 'wash out' differ- 
ences due to different HODs, £ remains unchanged for 
R > 1 /i -1 Mpc and is only mildly different for R < 1 
Mpc. By showing that £ remains the same across differ- 
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Fig. 6. — Effects of calculating void statistics in real space vs. 
redshift space. Top: Ratio of the VPF, Poi in redshift space to real 
space. There are more voids on large scales in redshift space be- 
cause of coherent galaxy infall into groups and filaments. Bottom: 
Reduced VPF in real space for mock galaxy catalogs, plotted as 
differences between the mock galaxies and the negative binomial 
model. Unlike the reduced VPF in redshift space, here there are 
trends for different HOD parameters; smaller values of M m i n agree 
less well with the negative binomial model than larger values. 



cnt HODs when Pq and N remain the same, we further 
solidify our claim that Pq is solely a function of Nxi, 
and hence that Pq does not contain additional informa- 
tion beyond what is available in £. 

To quantify the differences between real and redshift 
space we calculate void statistics for our five HOD models 
(B1-B9) in real space and compare them to their redshift 
space analogs. The top panel in Fig. [5] displays the frac- 
tional probability increase of the VPF when comparing 
redshift space (Po(z)) to real space (Po(r)). Two com- 
peting effects are at work here: (1) peculiar velocities 
smear out small scale clustering, and (2) Kaiser infall, 
due to the co herent infall of structures on larger scales 
l|Kaiserlll987j) , which has the effect of increasing the size 
of voids on large scales in redshift space. 

The general trends in this figure are consistent with the 
underlying HOD of each sample. For example, model Bl 
preferentially places galaxies in higher-mass halos com- 
pared to other HODs, and more massive halos have larger 
velocity dispersions. The larger dispersions decrease the 
size of voids as seen in redshift space, explaining why 
there are fewer voids in redshift space compared to real 
space for the Bl model. This model also has the smallest 
differences on large scales. This is in part because veloc- 
ity dispersions are small compared to the largest voids, 
but also because of Kaiser infall, which increases the ap- 
parent size of voids in redshift space on large scales, is 
important primarily for lower mass halos that are falling 
into forming structures. Since M m i n is so large in the Bl 
model, we expect that most halos in this model are not 
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mial model. It is primarily the clustering properties of dark matter 
halos, not the galaxies within them, that generates agreement with 
the negative binomial model. 



falling onto forming superstructures. 

In the bottom panel of Fig. we see that the reduced 
VPF can in fact uniquely constrain the halo model when 
the analysis is performed in real space. The VPF for 
a set of catalogs can be entirely described by a single 
function /(£,7V), and hence cannot be used to uniquely 
constrain models, if the reduced VPF for those models 
are all the same. Yet for our five HOD models, this is 
not the case in real space. Instead, there is a definite 
trend of x with M min ; samples with smaller M min are 
less like the negative binomial model. On larger scales 
(~ 5 h^ 1 Mpc) the HODs are again indistinguishable, 
as is expected since HOD parameters are relevant only 
on halo scales (i.e. ~ 1 hr 1 Mpc). Although it seems 
intuitive that the VPF should be capable of uniquely 
constraining the HOD (specifically M m j„), redshift-space 
effects make this extremely difficult in practice. 

4.3.2. VPF for Dark Matter Halos 

We can better understand the insensitivity of the VPF 
to particular parameters of the HOD by considering the 
VPF for the centers of dark matter halos (restricted to 
have Mhaio > M min ) alone. Fig. [7| compares the reduced 
VPF for dark matter particles, dark matter halo cen- 
ters, and mock galaxies. As the figure indicates, it is 
primarily the clustering properties of dark matter halos, 
not the number of galaxies within them, that generates 
agreement with the negative binomial model. 

We find the similarity in reduced VPF for halo cen- 
ters and mock galaxies intuitive for two reasons. First, 
as most dark matter halos in our simulations with M > 
M m i n contain a single isolated galaxy, and galaxies in 
most halo models are placed first in the center of the 
halo, it is reasonable that the clustering of halo centers 
has more impact on the galaxy reduced VPF than a. 
Second, when more than one galaxy is placed in a halo, 
it is typically within ~ 500 h^ 1 kpc of the halo center, 
as very few halos have larger radii. Hence a sphere con- 
taining a satellite galaxy will typically already include 
the galaxy at the halo's center. This also explains why 
void statistics are insensitive to th e spatial distribution 
of gala xies within halos, as noted bv lBerlind &; Weinberg! 
(2001. 
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In conclusion, the VPF does not seem to be capable 
of providing competitive constraints on the major pa- 
rameters of the HOD. This is due both to redshift space 
distortions washing out information on small scales and 
to the fact that the clustering of dark matter halos alone 
dominates the VPF. Although one might have thought 
that the VPF for galaxies and halos would be uniquely 
sensitive to M m i n , it turns out that any changes in M m i n 
are likewise reflected in £, in a way that can exactly ac- 
count for changes in the VPF. 

5. VOID STATISTICS FROM DEEP2 

We now measure void statistics in state-of-the-art red- 
shift surveys that are mapping the three-dimensional po- 
sitions of thousands of galaxies. In this section we probe 
void statistics when the universe was roughly half its 
present age (z ~ 1). In the following section we focus 
on voids in the local universe (z ~ 0). 

5.1. The Data 

Th e DEEP2 Galaxy Redshift Survey ijDavis et all 
120041) is an ongoing project that is gathering optical spec- 
tra for ~ 50, 000 galaxies at z ~ 1 using the DEIMOS 
spectrograph on the Keck II 10-m telescope. The com- 
pleted survey will span a comoving volume of ~ 10 6 /i -3 
Mpc 3 , covering 3 deg 2 over four widely separated fields; 
observations began July 2002 and are expected to be 
completed in mid-2006. Due to the high dispersion and 
excellent sky subtraction provided by the DEIMOS spec- 
trograph (R ~ 5,000), our rms redshift errors, deter- 
mined from repeated observations, are ~ 30km s , mi- 
nuscule compared to void scales. 

Target galaxies are selected using BRI imaging from 
the CFHT telescope down to a limiting magnitude of 
R = 24.1 (all magnitudes in this paper are in the AB 
system; see Coil et al. 2004 for photometric details). 
In three of the four fields we also use apparent colors 
to exclude objects likely to have z < 0.7. This pre- 
selection greatly enhan ces our efficiency for targeting 
galaxies at high redshift ijNewman et al.l l2005) . A fourth 
field, the Extended Groth Strip (EGS), has no redshift 
pre-selection, and is not used in the present analysis. 
For galaxies with a successfully identified redshift, abso- 
lute i?-band magnitudes (Mb) and restfr ame U — B col- 
ors, d enoted (U — B)o, have been derived l|Willmer et al.1 
2005). In the discussion below, we define "red" and 
"blue" galaxies according to whether (U — B)q > 1 or 
(U — B) < 1; this roughly corresponds to the saddle 
point of the co lor bimodality observed in DEEP2 data 
ijWillmer et~aTI 12005(1 . The current study uses 12,000 
rcdshifts with 0.75 > z < 1.0 in three fields covering 
~ 2.2 deg 2 . The three fields we use correspond to the 
DEEP2 poin tings 21, part of 2 2, 31, 32, 33, 41, and 42, 
as defined in lCoil et alJ l|2004a|) . 

An important aspect in analysis of any large-scale 
galaxy survey is the proper handling of selection effects 
that may vary as a function of galaxy property, such 
as color and luminosity. DEEP2 is an i?-band limited 
survey, which corresponds to restframe UV at z > 1, 
and results in fewer red galaxies be ing targeted at highe r 
redshift compared to blue galaxies i fWillmer et al.1 2005'). 
Due to this selection effect and the generally lower sam- 
pling density beyond z ~ 1, we limit our analysis in this 
study to z < 1. 
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— 19 and a number density of 0.006 Mpc~ 3 . Po values are for 
galaxies in the full simulation box at z ~ 1 (stars), galaxies in a 
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the plot more readable; each set of points is based on spheres of 
the same radius. Error bars are obtained from averaging over three 
mock light cones, which correspond to the three separate DEEP2 
fields. 



5.1.1. Survey Geometry and Completeness 

Each DEEP 2 field is much longer in the redshift direc- 
tion than on the sky; the 1 — 2 x 0.5 deg 2 fields used for 
this work span 40 — 80 x ^20 h~ x Mpc in transverse co- 
moving extent, while the range 0.7 < z < 1.0 corresponds 
to 560 hr 1 Mpc comoving in the redshift direction. The 
number of possible independent spheres contained within 
the survey, as indicated by V survey /V sp here, varies from 
~ 10 5 for 1 h- 1 Mpc spheres to ~ 10 3 for 7 h^ 1 Mpc 
spheres. 

One might be concerned that the survey geometry 
would skew void statistics when probing voids with di- 
ameters comparable to the short dimension of the survey. 
We have tested this effect by comparing the VPFs from 
a full mock galaxy simulation box at z *~ 1, constructed 
from the 'primary' simulation (see tJ4.1|l . to mock galaxy 
catalogs with a "lightcone" geometry similar to DEEP2. 
The lightcone geometry is constructed by stacking to- 
gether several simulation box outputs and slicing through 
them diagonally, so as not to pick up the same structures 
at different times. 

We find that, even at the largest radii tested , the VPF 
in the mock lightcone and larger simulation box agree to 
within 1(T (Fig. [HJ). Of course, due to the decrease in the 
number of independent volume elements at large void 
radii, cosmic variance increases, and hence the scatter 
(computed across three mock light cones) increases. We 
conclude that the geometry of the DEEP2 survey allows 
us to accurately probe voids to radii of at least 7 h^ 1 
Mpc comoving. 

Another selection effect is the survey completeness. 
The DEEP2 survey spectroscopically targets ~60% of 
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objects that pass the apparent magnitude and color cuts 
mentioned above. Of those targeted objects, we are able 
to secure redshifts for > 70%. Follow-up observations 
have shown that ~ 10% of the targets are objects at 
z > 1.5 (C. Steidel, private communication). We there- 
fore have successful redshifts for ~ 50% of all galaxies 
at 0.75 < z < 1.0 in the surveyed fields with apparent 
magnitude of R < 24.1. This sampling rate is effectively 
scaled out when we dilute the mock galaxy catalogs to 
compare to the DEEP2 data. Further complications arise 
because the sampling rate is non-uniform due to the ne- 
cessities of slitmask design. Spectra of objects are not 
allowed to overlap on the CCD, such that objects that 
lie near each other in the direction on the sky that maps 
to the wavelength direction on the CCD can not be si- 
multaneously observed; this results in modest (< 10%) 
under-sampling in regions with the highest density of 
targets on the plane of the sky (each point on the sky 
is covered by multiple DEEP2 slitmasks to ameliorate 
this effect). We model this effect by applying the ac- 
tual DEEP2 maskmaking algorithm to the mock galaxy 
catalogs and then computing the VPF. We find that the 
impact of the target selection algorithm on the VPF is 
negligible to within la (see Fig.[SJ|. One would have ex- 
pected this, as the effect is most relevant on small scales 
and in overdense regions. 

There are also selection effects due to bright stars and 
other regions that were not observed. These result in 
inhomogeneities on much larger scales (> 1 Mpc) that 
could potentially be more relevant for void studies. To 
take account of these effects for the DEEP2 sample, we 
generate an angular window function that has, for each 
right ascension and declination, the completeness at that 
point determined from the bad-pixel masks used in mak- 
ing our photometric catalogs and the observed redshift 
success for slitmasks covering that point. This window 
function therefore masks unobservable regions such as 
areas around bright stars (they are given a complete- 
ness of 0.0). We then convolve this window function 
with a circular kernel proportional to the path length 
through a sphere of radius R projected on the sky (i.e., 
K(Aa, AS) tx \/ p 2 — (Aa) 2 — (AS) 2 , where p is the pro- 
jected radius of the sphere in arcseconds, and Aa and 
A5 are the separations on the sky from the center of the 
kernel in the right-ascension and declination directions, 
respectively) . 

This convolution has a simple physical consequence: 
we treat a test sphere completely inside the survey vol- 
ume at a region of 50% completeness as equivalent to 
a sphere only 50% in the survey volume at a region of 
100% completeness. Our final product is a window func- 
tion that contains, at each point, the projected complete- 
ness averaged over a sphere of a given radius, weighting 
by the path length through the sphere. We then throw 
down random spheres only at points above a minimum 
convolved completeness. This allows us to robustly avoid 
regions of bright stars, regions of low completeness (due, 
for example, to bad weather during observations) and the 
edges of the survey. For DEEP2, we set the complete- 
ness threshold at 55% so that the allowable completeness 
range varies over the survey by < 10%. It is more im- 
portant for our purposes to be uniformly complete than 
highly complete, as an overall dilution of the sample does 
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Fig. 9. — Schematic demonstration of the statistical nature of 
the VPF. The top panel shows the comoving projected positions for 
galaxies in a section of one DEEP2 field. The suppressed dimension 
is ~ 10 h^ 1 Mpc thick. The bottom panel shows the empty spheres 
found with a search radius of 6 h Mpc. We show a very small 
fraction of the actual number of empty spheres detected at this 
radius for clarity. 



not effect the reduced VPF. 

We test the accuracy of this method by dividing the 
measured average number of galaxies in a sphere of radius 
R, N, by the sphere volume for each radius. We find 
that this inferred number density is constant with sphere 
radius, providing good evidence that the larger spheres 
do not tend to lay farther outside the survey geometry 
than smaller spheres. Finally, we have spot-checked by 
eye the locations of the largest voids to ensure that they 
fall within the survey geometry (see Fig.^J. 

5.2. Results at z ~ 1 

We investigate the VPF and reduced VPF for DEEP2 
galaxies in three ways. First we consider an "overall" 
sample consisting of all galaxies with —22 < Mb — 
51og(h) < —19 and 0.75 < z < 1.0. Then we compute the 
VPF as a function of galaxy color and of luminosity, us- 
ing volume-limited samples. Errors for all DEEP2 mea- 
surements were derived from jackknife sampling using 
subvolumes of the three widely separated DEEP2 fields. 

We find excellent agreement between the overall 
DEEP2 sample VPF and the mock galaxy catalog VPF 
fFig. 1101 top panel). The mock catalogs were randomly 
diluted to have the same number density (n = 0.006 
h 3 Mpc~ 3 ) as the DEEP2 overall sample. The dashed 
line is a prediction of the negative binomial model. We 
show the reduced VPF, plotted as differences from the 
negative binomial prediction, for the overall sample in 
the top panel of Fig. ^2 top panel. The good agreement 
between the data and the model implies that the VPF 
can be described entirely by N and £. The largest de- 
viations on large scales seen here and below are of low 
statistical significance; the data are highly covariant from 
bin to bin, and hence our errors are underestimates of the 
true error. The excellent agreement between the overall 
sample and mock galaxy VPF reflects the fact that the 
mock galaxy catalogs were constructed to match the ob- 
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Fig. 10. — Top: Po for the overall DEEP2 sample compared to 
mock galaxy catalogs. The mock catalogs have an identical geom- 
etry as the DEEP2 survey volume and have been passed through 
the DEEP2 target selection algorithm. Both samples have the 
same number density (n = 0.006 h A Mpc -3 ). The good agree- 
ment between the mock catalogs and the data, although on the 
surface encouraging, is due entirely to the fact that the mock cat- 
alogs were constructed to match the DEEP2 two-point correlation 
function (see text for details). Bottom: DEEP2 Po separately for 
red and blue galaxies, both diluted to n = 0.002 h 3 Mpc -3 . The 
dilution removes any number density effect on the VPF, implying 
that we are only seeing effects of clustering on the VPFs shown 
here. Lines are predictions of the negative binomial model. 



served £ in the DEEP2 data. Stated differently, since 
these samples have the same number density (by con- 
struction), the fact that they have similar VPFs implies 
that they will have similar £ values, and vice versa. 

We next divide the DEEP2 sample into galaxies with 
(U — B)o < 1 or > 1, roughly matching the observed 
saddle point in the color bi-modality. We find that sam- 
ples of red galaxies have more and/or larger voids than 
samples of blue galaxies (Fig. ^3 bottom panel; both 
samples were randomly diluted to n = 0.002 /i 3 Mpc -3 ). 
This dilution is critical; without it, the differences be- 
tween the red and blue galaxy VPF would be dominated 
by differences in the observed number density of red ver- 
sus blue galaxies. It would be very difficult to separate 
the effects of number density from clustering strength on 
the number and size of voids if we simply compared the 
VPFs of undiluted samples. 

As with the overall sample, we find that the reduced 
VPF of both the blue and red galaxy populations follows 
the negative binomial model within the errors (Fig. 1111 
top panel). Again, this implies that the differences be- 
tween the VPFs for blue and red galaxies (Fig. 1101 bot- 
tom panel) are solely due to differences in their two-point 
correlation functions. The similarity in reduced VPFs is 
somewhat surprising. We know that blue and red galax- 
ies are biased differently relative to the dark matter (e.g., 
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Fig. 11.— Reduced VPF for DEEP2 galaxies, plotted as dif- 
ferences between the negative binomial model and the data. Top: 
Differences for the overall, red, and blue samples. The data are 
consistent with the model. Bottom: Differences for the DEEP2 
sample in differential magnitude bins. Again there is good agree- 
ment between the data and the model. The deviations on the 
largest scales are likely insignificant due to the large covariance 
between bins, which results in an underestimate of the true error. 



iZehavi et all2lMICoii"et all2004bll 2005. and references 
therein), and that the S p values are dependent on the 
bias. We might have expected the reduced VPFs, which 
according to Eqn. should depend on the S p values, to 
be different for these two populations. 

The halo model affords a more direct interpretation of 
these results. We have seen in S I4.3l th.at the reduced VPF 
is insensitive to parameters of the HOD. From studies of 
galaxy correlations at z ~ 0, it is becoming apparent that 
different populations of galaxies can have very different 
HODs. Hence the result that red and blue galaxies have 
similar reduced VPFs is consistent with our tests using 
mock catalogs; very different HODs will still produce the 
same reduced VPF. 

We also investigate the DEEP2 VPF dividing the sam- 
ple into three luminosity bins: —20 < Mb < —19, 
-21 < M B < -20, and -22 < M B < -21. We take 
a slightly different approach for this analysis. Since the 
number densities of the three luminosity subsamples vary 
by roughly a factor of four, the dilution required for 
a direct comparison of VPFs can discard a great deal 
of information and unnecessarily increases Poisson er- 
rors. For this reason, we begin by analyzing the reduced 
VPF for each luminosity sample (Fig. ^2 bottom panel), 
again finding good agreement with the negative binomial 
model. Using this model, we can then use Eqn.^Jto pre- 
dict_the VPF at any radius based solely on our knowledge 
of N and f at that radius. In Fig. U3 we plot the VPF 
for each of the three magnitude bins (without diluting 
the number density) and also show the predictions based 
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Fig. 12. — Plot of Pq for DEEP2 galaxies as a function of lumi- 
nosity. The competing effects due to the number density and clus- 
tering strengths drive the differences between subsamples. Given 
that the reduced VPF for these subsampl es c an be characterized 
by the negative binomial model (see Fig. 1111 . we predict Pq for 
each sample using only the number density and volume-averaged 
two-point correlation function measured within spheres of the given 
radius (lines). The excellent agreement seen here between the d ata 
and the predictions simply reflects the agreement found in Fig. 1111 
{bottom). 



on the negative binomial model; again, the agreement is 
within the errors. Knowledge of the form of the reduced 
VPF allows us to separate the effects of number density 
from clustering on the VPF. 

We note that the three magnitude bins cover slightly 
different redshift intervals. While the faintest bin (—20 < 
M B -51og(h) < -19) spans 0.7 < z < 0.85, the brightest 
bin (-22 < M B - 51og(h) < -21) spans 0.83 < z < 1.05. 
Each redshift interval is chosen such that the sample be- 
ing considered should have a roughly constant number 
density as a function of redshift. This ensures that there 
is no artificial redshift dependence in the void distribu- 
tion; i.e., the samples are volume limited. The less lumi- 
nous galaxies have a constant number density only over 
a limited redshift range because they have R > 24.1 at 
higher rcdshifts. 

The reduced VPF of all populations of galaxies at z ~ 1 
is well fit by the negative binomial model. Hence the 
VPF, a statistic that in principle relies on the infinite hi- 
erarchy of correlation functions, can in fact be accurately 
described solely by the number density and the volume- 
averaged two-point correlation function of the sample. 
We conclude that, at z ~ 1, the VPF provides no con- 
straints on either the halo model or on cosmological pa- 
rameters that cannot be gleaned from studies of correla- 
tion statistics. 

6. VOID STATISTICS FROM SDSS 

6.1. The Data 

The Sloan Digital Sk y Survey (SDSS;|Yori[eFaDIIS 
lAbazaiian et alJl2004tl is an extensive photometric and 
spectroscopic survey of the local universe. Imaging data 
exist over 10 4 deg 2 in five bandpasses, u,g,r,i, and z. We 
use color conversions provided by M. Blanton (private 



communication) to derive B-band magnitudes in the AB 
system, with typical 1-a errors in the conversion of 0.2 
magnitudes. Approximately 10 6 objects are being tar- 
geted for follow-up spectroscopy as part of the SDSS; 
most spectroscopic targets are brighter than r = 17.77 
l)Strauss et al.ll2002[) . Automated software performs all 
the necessary data reduction, including the assignment 
of redshifts. Redshift errors are ~ 30 kms -1 , similar 
to DEEP2 . The spectro graph tiling algorithm ensures 
nearly complete sampling ( Blan ton et al.ll2003a|) , yet the 
survey is not 100% complete due to several effects: (1) 
fiber collisions that do not allow objects separated by 
< 1' to be simultaneously targeted, affecting ~ 6% of 
targetable objects, (2) a small fraction (< 1%) of tar- 
geted galaxies fail to yield a reliable redshift, and (3) 
bright Galactic stars block small regions of the sky. Un- 
like the DEEP2 survey, (3) is not as important for SDSS 
because a bright star will block out a much smaller co- 
moving volume at z ~ compared to z ~ 1. The first 
of these effects is only important on small scales, and is 
likely negligible for void statistics since an undersampled, 
intrinsically high density region will not be counted as a 
void. 

For this analysis we make use of t he hybrid NYU Valu e 
Added Galaxy Catalog (VAGC) (|Blanton et all [2005). 
This catalog combines the SDSS Data Release 2 with 
a multitude of other publicly available catalogs (2dF, 
2MASS, IRAS PSCz, FIRST, and RC3), and includes 
a variety of derived p arameters including if-corrections 
(Blant on et alJl2003bl) and structural parameters. 

From the VAGC we have selected the two largest con- 
tiguous regions of the SDSS. We will call SDSSl the 
region at a = 190°, S = 50°, which includes ~ 103,000 
galaxies with spectroscopic z < 0.2 and SDSS2 the re- 
gion centered on a = 190°, 6=1° with ~ 87, 000 galaxies 
at z < 0.2. We divide red and blue galaxies in the SDSS 
data at the valley visible in the rest-frame g — r color 
distribution at (g — r) = 0.7. 

In addition to the 'main' SDSS sample there is a sec- 
ondary targeting algorithm designed to identify large 
numbers of luminous red galaxies (LRGs) at moder- 
ate redshifts via photometric color cuts. These LRGs, 
though low in spatial number density, cover an enormous 
volume, and are hence ideal objects for m easuring very 
large scale clustering in the u niverse (see iZehavi et alJ 
120051* lEisenstein et all 120051 for a more detailed de- 
scription of the SDSS LRG sample). The LRG sam- 
ple we study here contains ~ 10, 000 galaxies and spans 
the redshift range 0.16 < z < 0.46 and luminosity range 
-23.2 < M g < -21.8 with (g-r) > 0.7 (note that this is 
the only case where we use SDSS absolute magnitudes). 

6.1.1. Survey Geometry and Completeness 

Since the SDSS is a low-redshift survey, the angular 
size of a test void sphere of the same comoving radius 
varies enormously over the redshift range 0.05 < z < 0.2. 
Paralleling our analysis of the DEEP2 sample, we ac- 
count for geometry and completeness effects in the fol- 
lowing way. We generate an angul ar window function for 
the S DSS with the aid of Mangle l|Hamilton fc Tegmarkl 
120041) over a dense grid in right ascension and declina- 
tion; the resulting resolution is 0.15 degrees in right as- 
cension and declination. Completeness values of either 
or 1 were assigned to each right point of this grid de- 
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pending on the spectroscopic coverage and locations of 
bright stars (as SDSS spectroscopy is highly uniform in 
depth). We then convolve this window function with a 
kernel that represents the depth through the test void 
projected on the sky (see § 15.1.11 for details) and only 
place random spheres in regions above a minimum con- 
volved completeness. This allows us to avoid placing test 
spheres in poorly-sampled regions. We set this threshold 
at 85%, so that the completeness within spheres placed 
down varies over the survey by < 10%. 

The net result of the strong scaling of angular size 
with redshift is that larger-volume test spheres cannot 
be placed in the lowest redshift bins because they would 
span a region of the sky comparable to the entire survey. 
Hence at larger void radii we are restricted to higher 
redshifts. This does not cause a detrimental bias, how- 
ever, because we only consider galaxies over a redshift 
range such that their number density is approximately 
constant, i.e., volume-limited samples. Hence a bias to- 
wards slightly higher redshifts (z ~ 0.1) for larger voids 
will not skew our statistics, assuming that there is not 
strong void statistic evolution from z ~ to z ~ 0.1. 
Indeed no significant evolution has been de tected in the 
VPF out to z ~ 0.3 l|Hovle fc Vogelevll200l . 

There is one final effect in the SDSS data that, if 
not properly treated, could significantly bias our results. 
There is a massive structure i n SDSS2 at z ~ 0.08 
dubbed the "Sloan Great Wall" (|Gott et al.l2005|) . which 
will strongly affect any clustering measurements. This 
structure has been removed in the correlation function 
studies of Ze havi et alJ l)2002j) . and we do the same here 
by defining our samples to avoid 0.075 < z < 0.085 in 
the SDSS2 sample. The structure extends across the 
entire angular extent of SDSS2; if included it would 
cause gross underestimates of the cosmic error in any 
large-scale structure measurement. Based on many real- 
izations of large cosmological simulations with Gaussian 
initial conditions, it is expected that a structure of this 
size occurs in a volume the size of the SPS S approxi- 
mately 10% of the time l|Tegmark et al.ll2004|) . With the 
full SDSS dataset, it should be possible to accurately 
account for this enormous structure in the error budget 
without relying on simulations. 

In order to accurately measure quantities via our 
counts-in-cells approach, it is important that the number 
of randomly placed test spheres exhausts the number of 
independent volumes in the survey. The second data re- 
lease of the SDSS over the interval 0.8 < z < 0.15 spans 
a volume of ~ 10 7 h~ 3 Mpc 3 . This corresponds to ~ 10 6 
independent volumes for 1 h^ 1 Mpc spheres and ~ 10 3 
for 15 hr 1 Mpc spheres, a factor of 10 more than in the 
DEEP 2 survey. The number of spheres we use always 
exceeds the number of independent volumes. 



6.2. Results at 







We now present void statistics in the local universe us- 
ing the SDSS dataset. As in our analysis of DEEP2 data, 
we investigate void statistics for three sets of SDSS sam- 
ples: an "overall" sample with —22 < Mb ~ 51og(h) < 
— 19 and 0.09 < z < 0.14, two subsamples split accord- 
ing to g — r color (at (<? — r) = 0.7), and three subsam- 
ples differing in luminosity. In this section, as before, we 
plot reduced VPFs as differences from the negative bi- 
nomial model in order to clearly show small deviations. 
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Fig. 13. — Reduced VPF for SDSS galaxies, plotted as differences 
from the negative binomial model. Top: The overall, blue, and red 
galaxy samples agree well with the negative binomial model. The 
largest discrepancy is for blue galaxies, although due to the large 
covariance between bins, these errors are underestimates of the true 
error. Bottom: Reduced VPF for SDSS galaxies as a function of 
luminosity. All luminosity classes agree well with the model, with 
the largest deviations seen in the intermediate luminosity sample. 



As for the DEEP2 luminosity subsamples, All compar- 
isons made in this section are made without requiring 
that samples have similar number densities, and by us- 
ing the negative binomial model to predict the VPF (via 
Eqn. I12|) . Errors for all quantities were derived from 
jackknife subsamples. 

The reduced VPFs for the overall, red, and blue sam- 
ples (Fig. El top panel) are all well described by the 
negative binomial model. As in the previous sections, 
we can use this measured agreement to_predict the VPF 
at any radius based solely on N and £ for that radius, 
(e.g. as estimated from counts-in-cells), for each sam- 
ple. Fig. [21 (top panel) plots the VPF for these samples 
and compares them to the predictions from the negative 
binomial model (lines). The trend that the red galaxy 
sample has more and/or larger voids than the blue sam- 
ple is due to the fact that it is both more strongly clus- 
tered and lower in number density than our sample of 
blue galaxies. That the measured data agree with the 
negative binomial model within errors implies that, just 
as at z ~ 1, the VPF cannot be used to uniquely con- 
strain the halo model or cosmological parameters. As 
with the overall DEEP2 VPF, the blue SDSS data do 
not agree within la errors on the largest scales; this is 
likely insignificant, as the data are highly covariant, and 
hence our errors are underestimates of the true error. 

Next, we compare void statistics for samples of SDSS 
galaxies as a function of luminosity. The reduced VPF 
for galaxies in all luminosity bins we consider (—22 < 
M B - 51og(h) < -21, -21 < M B - 51og(h) < -20, and 
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Fig. 14. — Plot of Pq measured for SDSS galaxies, overall and 
as a function of color (top) and luminosity (bottom). The data are 
shown as points, while lines are predictions from the negative bi- 
nomial model for each subsample. The predictions are determined 
from measurements of N and_£ for spheres as a function of radius. 
The agreement implies that N and £ alone determine Pq. 



—20 < Mb — 51og(h) < —19), is in good agreement with 
the negative binomial model (Fig. bottom panel). 
We again make this agreement explicit by computing the 
VPFs for these samples and comparing to that predicted 
by assuming the negative binomial model from the val- 
ues of N and £ (Fig. El bottom panel). Since these sam- 
ples are volume limited, each luminosity sample spans a 
slightly different redshift range, with the brightest span- 
ning the largest range (0.11 < z < 0.20) (since bright 
galaxies can be detected at greatest distances) and the 
faintest spanning the smallest range (0.05 < z < 0.07). 

Finally, we compute the VPF for the LRG popu- 
lation, defined to be bright (-23.2 < M g < -21.8) 
and red ((<? — r) > 0.7), spanning the redshift range 
0.16 < z < 0.46. Their low number density (n ~ 10~ 5 
ft, 3 Mpc~ 3 ) and large redshift range allow for a measure- 
ment of the VPF out to unprecedented scales (R= 40 
h^ 1 Mpc). Fig. 1151 shows that the LRG population is 
well described by the negative binomial model out to the 
largest void radii tested. Unlike our analysis of other 
SDSS data, we plot the VPF separately for the two re- 
gions used in this study (denoted SDSSl and SDSS2). 
The difference between the two regions reflects the im- 
pact of cosmic variance on void statistics. 

The reduced VPFs of all galaxy populations explored 
at z ~ are in good agreement with the negative bino- 
mial model. It is quite surprising that this agreement, 
which provides a simple mapping from TV and £ to Po, 
is equally valid from z ~ 1 to z ~ 0. What this tells 
us is that the evolution in the VPF for a given popu- 
lation from z ~ 1 to z ~ must proceed in lockstcp 
with evolution in the number density and two-point cor- 
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Fig. 15. — Measured Po for the SDSS LRG population over 
the redshift range 0.16 < z < 0.46. Unlike for the other SDSS 
data samples, here we are able to measure Po to much larger radii 
and therefore show results separately for each of the two SDSS 
regions on the sky, denoted SDSSl and SDSS2 (see JCTJ. The 
difference between the two regions reflects the cosmic variance of 
their volumes. As before, we show jackknife error estimates. The 
negative binomial predictions (lines) match the measured Po values 
well out to void radii of 40 h~ 1 Mpc. 



relation function of that population. Finally, it is en- 
couraging that the se conclusions at z ~ agree with 
iCroton et al.l l)2004|) , who measure void statistics for the 
low-redshift 2dF survey, and find good agreement be- 
tween the galaxy reduced VPF and the negative binomial 
model for a range of galaxy luminosities. 

7. DISCUSSION 

We have presented measurements of the void probabil- 
ity function (VPF) and reduced VPF for galaxies as a 
function of color and luminosity at z ~ 1 using DEEP2 
data and at z ~ with SDSS data. We find that all sam- 
ples are well described by the negative binomial model. 
This agreement implies that the VPF for a given sam- 
ple is determined entirely by the sample number density 
and volume-averaged two-point correlation function, £. 
In particular, evolution of the VPF for a population of 
galaxies from z ~ 1 to z ~ is governed by the evolution 
in £ and N for that population. We have furthermore 
shown, using mock catalogs, that this simple relation be- 
tween the VPF and £ holds for a wide range of halo 
models in redshift space, but breaks down in real space. 
We now discuss the relevance and implications of these 
findings. 

The VPF we measure at z ~ 1 is in good agreement 
with the VPF measured in mock galaxy catalogs cre- 
ated from ACDM simulations. Although this is encour- 
aging, our result that the two-point correlation function 
and number density determine the VPF implies that this 
agreement found between data and simulations is simply 
a consequence of the fact that the simulations were con- 
structed to match the number density and two-point cor- 
relation function of DEEP2 galaxies. Stated differently, 
the VPF currently cannot provide new constraints cither 
on cosmological parameters or on the method in which 
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galaxies are placed into dark matter N-body simulations. 

Previous void studies have interpreted the VPF ac- 
cording to its mathematical expansion as an infinite sum 
of higher-order correlation functions, using the hierarchi- 
cal Ansatz to relate the higher-order functions to lower- 
order functions. These studies then attempt to use the 
reduced VPF to test the validity of this Ansatz. Strictly 
speaking, however, finding that the VPF for various 
populations of galaxies can be characterized entirely by 
N and £ does not provide evidence for the validity of 
this Ansatz. In fact, the interpretation of the VPF in 
terms of higher-order correlation functions implies that 
the reduced VPF depends only on the scaling coefficients 
(S p = £p/£ p_1 ), which suggests that populations with 
different biases (and therefore different scalings between 
the three-point and two-point functions) should actually 
not have the same reduced VPF. We find just the oppo- 
site in the data, namely that populations with different 
biases have the same reduced VPF. The interpretation of 
the VPF as a sum of correlation functions affords little 
insight into these results. Despite this, the interpreta- 
tion predicts that the reduced VPF of a sample should 
be independent of that samples number density, and this 
is indeed observed. 

The halo model provides a more tractable theoretical 
framework. We have shown that the reduced VPF of 
mock galaxies is quite insensitive to particular param- 
eters of the HOD when voids are measured in redshift 
space. Using measurements of £ to constrain the halo 
model, it appear s that blue and red galaxies have very 
different HODs ijZehavi et al J 120 053). The samples we 
investigate here hence likely have a range of HODs. Yet, 
based on the result that different HODs generate the 
same reduced VPF, it is not surprising that different 
samples have the same reduced VPF. 

Furthermore, an analysis of the dark matter halo re- 
duced VPF has led us to conclude that it is the redshift- 
space distribution of halos themselves that is primarily 
responsible for the agreement between the measured re- 
duced VPF and the negative binomial model. The re- 
duced VPF changes very little when one populates dark 
matter halos with galaxies. It is only surprising that the 
minimum halo mass, M m i ni has very little effect on the 
reduced VPF as well. This insensitivity to M min im- 
plies that the changes in £ caused by changes in M m i n 
are enough to completely account for the changes in the 
VPF. 

We would like to stress that the strict dependence of 
the VPF on iV£ does not rely on any theoretical inter- 
pretation, including the hierarchical Ansatz, and, in par- 
ticular, does not depend on the reduced VPF following 
the negative binomial model. Our result that all galaxy 
populations studied here have consistent reduced VPFs 
immediately implies that the VPF can be entirely de- 
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scribed by 7V£. This, in turn, implies that the VPF is 
currently incapable of uniquely providing constraints on 
either cosmological parameters or particular aspects of 
the halo model, as any useful information provided by the 
VPF is likewise provided by two-point correlation func- 
tion analyses; the VPF is of Little use in understanding 
the large scale structure of the universe. 
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